04:00
2026-06-17
arxiv.org
natural-language-processing
Examining the Limits of Word2Vec with Toki Pona
Researchers tested Word2Vec on Toki Pona, a constructed language with only ~130 words, using 1.4 million sentences. They found that non-core tokens like loanwords improved embedding quality by drawingβ¦